Automated Discourse Segmentation by Syntactic Information and Cue Phrases

نویسندگان

  • Huong Le Thanh
  • Geetha Abeysinghe
  • Christian Huyck
چکیده

This paper presents an approach to automatic segmentation of text written in English into Elementary Discourse Units (EDUs) using syntactic information and cue phrases. The system takes documents with syntactic information as the input and generates EDUs as well as their nucleus/satellite roles. The experiment shows that this approach gives promising results in comparison with some of the prominent research relevant to our approach.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Automated Video Segmentation for Lecture Videos: A Linguistics-Based Approach

Video, a rich information source, is commonly used for capturing and sharing knowledge in learning systems. However, the unstructured and linear features of video introduce difficulties for end users in accessing the knowledge captured in videos. To extract the knowledge structures hidden in a lengthy, multi-topic lecture video and thus make it easily accessible, we need to first segment the vi...

متن کامل

Beyond String Matching and Cue Phrases: Improving Efficiency and Coverage in Discourse Analysis

RASTA (Rhetorical Structure Theory Analyzer), a discourse analysis component within the Microsoft English Grammar, efficiently computes representations of the structure of written discourse using information available in syntactic and logical form analyses. RASTA heuristically scores the rhetorical relations that it hypothesizes, using those scores to guide it in producing more plausible discou...

متن کامل

Now let's Talk about Now; Identifying Cue Phrases Intonationally

Cue phrases are words and phrases such as now and by the way which may be used to convey explicit information about the structure of a discourse. However, while cue phrases may convey discourse structure, each may also be used to different effect. The question of how speakers and hearers distinguish between such uses of cue phrases has not been addressed in discourse studies to date. Based on a...

متن کامل

Discourse Segmentation by Human and Automated Means

The need to model the relation between discourse structure and linguistic features of utterances is almost universally acknowledged in the literature on discourse. However, there is only weak consensus on what the units of discourse structure are, or the criteria for recognizing and generating them. We present quantitative results of a two-part study using a corpus of spontaneous, narrative mon...

متن کامل

Topic Segmentation of Web Documents with Automatic Cue Phrase Identification and BLSTM-CNN

Topic segmentation plays an important role for discourse analysis and document understanding. Previous work mainly focus on unsupervised method for topic segmentation. In this paper, we propose to use bidirectional long shortterm memory(BLSTM) model, along with convolutional neural network(CNN) for learning paragraph representation. Besides, we present a novel algorithm based on frequent subseq...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2003